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We  consider  the  problem  of  distributed  estimation  in  a  power  constrained  collaborative  wireless 
sensor  network,  where  the  network  is  divided  into  a  set  of  sensor  clusters,  with  collaboration 
allowed  among  sensors  within  the  same  cluster  but  not  across  clusters.  Specifically,  each  cluster 
forms  one  or  multiple  local  messages  via  sensor  collaboration  (in  particular,  linear  operation  is 
considered)  and  transmits  the  messages  over  noisy  channels  to  a  fusion  center.  The  final  estimate 
is  constructed  at  the  fusion  center  based  on  the  noisy  data  received  from  all  clusters.  In  this 
collaborative  setup,  we  study  the  following  fundamental  problems.  Given  a  total  transmit 
power  constraint,  shall  we  transmit  the  raw  data  or  some  low- dimensional  local  messages  for 
each  cluster^  What  is  the  optimal  collaboration  scheme  for  each  cluster"?  How  do  we  optimally 
allocate  the  power  among  different  clusters?  These  questions  are  addressed  in  this  article.  We 
will  show  that  the  optimum  collaboration  strategy  is  to  compress  the  data  into  one  local  message 
that,  depending  on  the  channel  characteristics,  is  transmitted  using  one  or  multiple  available 
channels  to  the  fusion  center.  The  optimal  power  allocation  among  the  clusters  is  also 
investigated,  which  yields  a  water-filling  type  of  scheme. 

Key  words:  Distributed  estimation;  wireless  sensor  network;  sensor  clusters;  power 
allocation;  collaboration  strategy;  data  transmission;  estimation  distortion. 


Distributed  estimation  has  attracted 
much  attention  recently.  One  of  the 
network  architectures  for  distributed 
estimation  involves  a  set  of  spatially 
distributed  sensors  linked  with  a 
fusion  center  (FC).  Each  sensor  makes  a  noisy 
observation  of  the  phenomena  of  interest  and  transmits 
its  processed  information  to  the  FC,  where  a  final 
estimate  is  formed.  The  problem  of  optimal  power 
allocation  among  sensors  given  a  total  transmit  power 
constraint  was  considered  in  Cui  et  al.  (2007),  Li  and 
AlRegib  (2007),  Wu,  Huang,  and  Lee  (2008),  and 
Xiao  et  al.  (2006);  the  goal  was  to  minimize  the 
estimation  distortion  at  the  FC.  For  most  of  these 


works,  intersensor  communication  is  not  considered. 
Intersensor  collaboration  can  indeed  be  exploited  to 
enhance  transmission  energy  efficiency  and  improve 
system  performance. 

In  this  article,  we  consider  distributed  estimation  in 
a  hierarchical  network  architecture  with  localized 
collaboration.  Specifically,  we  assume  that  the  network 
is  divided  into  a  number  of  sensor  clusters  linked  with  a 
FC.  The  sensors  within  the  same  cluster  have  the 
communication  resources  to  locally  collaborate,  where¬ 
as  no  collaboration  is  allowed  across  clusters.  This 
might  be  the  case  for  scenarios  where  multiple  sets  of 
sensors  are  spatially  distributed,  with  each  set  of 
sensors  within  a  small  neighborhood.  Each  cluster 
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Sensor  clusters 


Figure  1.  Collaborative  setting:  the  network  is  divided  into  a  number  of  sensor  clusters.  Sensors  within  each  cluster  can  collaborate  to 
convert  their  noisy  observations  {Xm}  into  some  local  messages:  {Zm/- 


then  transmits  one  or  multiple  one-dimensional 
messages,  which  could  be  the  raw  data  or  obtained 
via  sensor  collaboration  over  noisy  channels  to  the  FC 
where  a  final  estimate  is  formed  based  on  the  data 
received  from  all  clusters.  In  this  context,  the  following 
natural  questions  arise:  Given  a  fixed  amount  of  total 
transmit  power,  how  should  each  cluster  process  its 
local  measurements  such  that  a  minimum  estimation 
distortion  can  be  achieved  at  the  FC?  How  should  we 
allocate  the  power  among  the  different  clusters  in  an 
optimal  power-distortion  fashion?  These  questions  are 
be  addressed  in  this  article,  and  we  develop  a 
fundamental  understanding  of  this  important  hierar¬ 
chical  collaborative  strategy  for  distributed  estimation. 
Our  work  is  closely  related  to  the  distributed 
compression-estimation  approaches  in  Fang  and  Li 
(2008),  Luo,  Giannakis,  and  Zhang  (2005),  Schizas, 
Giannakis,  and  Luo  (2007),  Song,  Zhu,  and  Zhou 
(2005),  Zhang  et  al.  (2003),  and  Zhu  et  al.  (2005); 
their  objective  is  to  reduce  the  transmission  require¬ 
ments  via  dimensionality  reduction.  While  sharing 
certain  similarities  with  the  distributed  compression- 
estimation  approaches,  our  work  focuses  on  the 
optimal  collaboration  among  sensors  in  a  power 
constrained  scenario. 

System  model  and  problem  formulation 

We  consider  a  wireless  sensor  network  consisting  of 
N  spatially  distributed  sensors,  with  each  sensor 
making  a  noisy  observation  of  an  unknown  random 
parameter  6:  x  =  h„Q  +  w„,  where  denotes  the 
observation  gain  and  w„  denotes  the  additive  observa¬ 
tion  noise.  The  sensors  in  the  network  are  divided  into 


M  sensor  clusters  (Figure  1 ).  Each  cluster,  say  cluster 
m,  consists  of  closely  located  sensors.  The  sensors 
in  each  cluster  are  able  to  collaborate  to  form  local 
messages  that  are  sent  to  the  FC,  whereas  no 
communication  is  allowed  across  different  clusters. 
The  objective  is  to  obtain  an  estimate  of  the  unknown 
parameter  at  the  FC  based  on  the  information  received 
from  the  clusters.  In  practice,  the  sensor  collaboration 
can  be  easily  implemented.  For  each  cluster,  we  choose 
one  sensor  to  be  the  cluster  head  whose  task  is  to 
collect  the  data  from  other  sensors  within  the  same 
cluster  and  carry  out  the  collaborative  processing.  The 
resultant  local  messages  are  then  transmitted  by  the 
cluster  head  to  the  FC.  We  adopt  the  following 
assumptions  for  this  collaborative  setting. 

Al:  The  links  between  sensors  and  the  cluster  head 
within  each  cluster  are  ideal.  Sensor  collaboration  is 
confined  to  linear  operations. 

A2:  An  uncoded  analog  amplify-and-forward 
scheme  is  employed  to  transmit  the  local  messages 
from  the  cluster  heads  to  the  FC  over  noisy,  wireless 
channels. 

For  notational  convenience,  we  use  x„  „  to  denote  the 
sensor  measurement  of  sensor  n  in  cluster  m,  where  n  e 
{1,...,  iVj,  m  e  and 

“1“  (1) 

in  which  h^.n  and  "^m,n  denote  the  corresponding 
observation  gain  and  additive  observation  noise, 
respectively.  To  capture  the  cluster-based  collabora¬ 
tive  scenario,  we  write  the  measurements  within  a 
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cluster  in  a  vector  form: 
which  is  given  by 

(2) 

with  hfji  \bm,\  ^m,2  ■  •  ■  and 

. . .  ^ .  The  local  messages  via  sensor  collaboration 

within  each  cluster  can  therefore  be  expressed  as 

^TTi^rn^  (d) 

where  denotes  the  collaboration  matrix  for 

cluster  m,  is  the  dimensionality  of  the  message 

vector  whose  choice  is  discussed  later.  The  signal 
received  at  the  FC  from  the  iwth  cluster  is  given  by 


into  the  collaboration  matrices  [C„],  the  overall 
collaboration  matrix  C  has  to  satisfy  a  total  transmit 
power  constraint.  Specifically,  using  a  Linear  Mini¬ 
mum  Mean-Square  Error  (LMMSE)  estimator  (Kay 
1993),  it  can  be  readily  verified  that  we  are  faced  with 
the  following  optimization  problem: 

min  £[(0-0)^]  =  -  a^h^C^(CR,,C^  +  R^) ^ ^Ch 

s.t.  tr(CR,,C^)<P,  (6) 

where  denotes  the  signal  variance;  R,^  =  2s  [xx^]; 
tr(CR,.C^  is  the  average  transmit  power  required  to 
send  the  local  messages  from  all  clusters  to  the  EC;  and 
P  is  a  prespecified  power  budget  for  transmission. 


ym  Cr,,2A,„C,„X,„  “t"  V;„,  (4) 

where  6  denotes  a  fading  multiplicative 

channel  matrix,  which  can  be  diagonal  or  nondiag¬ 
onal,  depending  on  the  transmission  scheme  (e.g., 
orthogonal  vs.  nonorthogonal  channel  access);  = 
diagjfli,  ...,  ap^  is  the  amplification  matrix  with  a,- 
denoting  the  amplification  factor  used  in  transmitting 
the  ith  message  of  z„,;  denotes  the  additive 

channel  noise  vector.  Without  loss  of  generality,  we 
assume  G„  =  I  and  =  I,  where  I  denotes  the 
identity  matrix,  because  the  multiplicative  effect  of  the 
channel  matrix  can  be  removed  by  carrying  out  a  matrix 
inverse  using  an  estimate  of  the  channel  matrix  G„  at 
the  receiver  and  the  amplification  matrix  A„  can  be 
absorbed  into  C^.  We  have  the  following  assumption 
regarding  observation  noise  {w„,}  and  channel  noise 

[vj. 

A3:  Noise  {w,„}  and  {v„]  are  zero  mean  with  positive 
definite  autocovariance  [R™,J  and  respec¬ 

tively,  which  are  available  at  the  FC.  The  noise 
across  different  clusters  is  mutually  uncorrelated,  i.e., 
=  0  and  E[\i\j'^]  =  0  Vi  A  j- 

Let  y  =  [yi  yz  ■■.  yA/]^  denote  a  column  vector 
formed  by  stacking  the  data  received  from  all  clusters. 
We  have 


y  =  Cx-|-v  =  C(h0-|-w)-|-v,  (5) 

where  C  =  diag{Ci,  . . .,  Cm)  is  a  block  diagonal  matrix 
with  its  OTth  block-diagonal  element  equal  to  C„,  x  = 
[xi  X2  ...  xm]^,  V  =  [vi  V2  ...  ymV ,  h  =  [hi  h2  ... 
h^]^!  and  w  =  [wi  W2  ...  Wa^]^.  A  natural  question 
arising  from  this  scenario  is  to  find  an  overall  optimal 
collaboration  matrix  C,  or  equivalently,  a  set  of 
individual  collaboration  matrices  to  achieve 

a  minimum  estimation  distortion  at  the  FC.  Also, 
because  the  amplification  factors  are  incorporated 


Single  cluster  case 

The  development  of  the  optimal  collaboration 
matrix  for  the  single  cluster  case  is  quite  involved. 
Because  of  space  limitations,  we  only  present  the  main 
results  without  providing  the  proof. 

Theorem  1:  Consider  the  optimal  collaboration  design 
problem  formulated  in  (6)  and  described  in  Figure  1, 
where  the  sensor  measurements  x„„  the  local  messages 
z„,  and  the  received  messages  at  the  FC  y„  are  given  by 
(2),  (3),  and  (4),  respectively.  When  M  =  \,  the 
optimal  solution  to  (6)  is 

C*  =  YVPU4:,l]hX^i,  (7) 

where  U.„[:,  1]  denotes  the  first  column  of  U^;  U.„  is  an 
orthonormal  matrix  obtained  by  carrying  out  the 
eigenvalue  decomposition  of  R^,  i.e.,  R^,  =  Ui,Di,Uj; 

and  7  =  1/ ^h^R^^h.  The  associated  estimation  Mean 
Square  Error  (MSE),  i.e.,  the  value  of  the  minimum 
objective  function  of  (6),  is  given  by 

Proof:  A  rigorous  proof  is  provided  in  Eang  and  Li  (in 
press). 

The  optimal  solution  (7)  has  very  important 
implications  that  we  shall  explore  in  the  following. 
Considering  the  scenario  of  independent  channels,  i.e., 
R.„  is  diagonal;  U.„  =  I  and  U.„[:,  1]  =  Ci,  where  e,- 
denotes  the  unit  column  vector  with  its  ith  entry  equal 
to  1  and  its  other  entries  equal  to  0.  Therefore  the 
optimal  collaboration  matrix  becomes 


C*  = 


yVPh^R-i 

0(/.-l)xiV 


(9) 


which  is  a  matrix  with  its  first  row  equal  to 
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and  all  other  rows  equal  to  0.  The  solution 
suggests  that  we  should  compress  the  measurements  into 
only  one  local  message  and  transmit  it  via  the  best-quality 
channel  (note  that  the  first  row  corresponds  to  the  first 
channel,  which  has  the  smallest  noise  variance  because 
the  diagonal  elements  of  are  assumed  in  an  ascending 
order)  to  the  FC.  If  the  channels  have  identical  qualities, 
then  we  can  use  any  of  them  to  send  out  the  local  message. 
Also,  by  rewriting  the  collaboration  weighting  vector 
yt/Ph^R^^  as 

jVPh^'K^  =  yVPde-^a^h' R-i 

=  yyPcje-2Re,R-i,  (10) 

where  Rej.=  is[0x^],  we  can  immediately  see  that  the 
local  message  is  exactly  the  LMMSE  estimate  R0,jR^ 
multiplied  by  a  scalar  This  means  that  when 

channels  are  independent,  LMMSE  estimation  followed 
by  an  amplification  factor  is  optimal  in  a  power- 
distortion  sense. 

We  now  investigate  the  case  where  the  channels  are 
correlated,  i.e.,  R^  is  nondiagonal.  Each  row  of  the 
optimal  collaboration  matrix  can  be  readily  expressed  as 
follows  by  combining  (7)  and  (10): 

C*[i,  :]  =U4h  l]jVPa,^R,,R;\  (11) 

where  1]  is  the  (i,  l)th  entry  of  U^,.  Therefore  the 
LMMSE  estimate  is  transmitted  by  multiple  channels 
with  different  amplification  gains  that  are  proportional 
to  l]}Li-  The  number  of  local  messages  to  be 

transmitted,  1  <  ^  <  M,  should  be  as  large  as  possible 
because  the  more  channels  employed,  the  more  diversity 
that  can  be  provided.  However,  system  complexity  will 
also  increase  as  more  channels  are  involved. 


with  the  block  diagonal  structures  of  C,  R.;„,  and 
R„.  Therefore  the  optimization  problem  (6)  be¬ 
comes 


M 

max  ^  hfcf(C,R^,,-Cf  -fR,,,r^Qh; 

{C/}  i  =  i 

M 

s.t.  ^tr(C,R„yCf)<P  (13) 

i=l 

in  which  the  power  constraint  follows  from 
M 

tr(CR,C'")=^tr(C,R,,,Cf). 

i=l 

To  use  the  theoretical  results  obtained  for  M  =  1,  we 
express  the  component  h/  C/  (C,Rm,;C/  -|-R.„,,W  C,h; 
in  (13)  as  a  function  of  hf  Cf{CiRxjCf  +  R.v,i)~^Cihi, 
which  can  be  done  by  resorting  to  the  Woodbury  identity: 

<4  -  Cf  (C,R,,,-Cf  -h  R.,;)  - 1  C,h, 

=  (<7,-2  +  hf  cf  (C,-R^,,-Cf  -h  R.,0  -  ic,h,)  -  2 .  (14) 
For  notational  convenience,  let 


p,(C0  =  hf  cf  (C,R^,,-Cf  +  R.,0  -  'C,-h,- 
Ti,.(Q)  =  hfCf(QR,,,Cf  +  R.y)-2Qh;.  (15) 


Therefore  (14)  can  be  rewritten  as 


H(C)  = 


<^0 


-1 


(16) 


Substituting  (16)  into  (13),  we  arrive  at  the  following 
optimization 


Multiple  cluster  case 

We  now  examine  a  general  scenario  where  the  network 
consists  of  multiple  sensor  clusters.  In  this  case,  the 
collaboration  matrix  C  has  a  block  diagonal  structure 
because  intercluster  collaboration  is  not  allowed.  The 
approach  described  in  previous  subsection,  therefore, 
cannot  be  directly  applied  here.  To  solve  (6),  we  hope  to 
decouple  the  optimization  problem  into  a  set  of  tractable 
subtasks.  To  this  goal,  we  rewrite  the  estimation  MSE  as 
follows  (Fang  and  Li  in  press). 

£[(0  -  0)2]  =  ct2  -  cT4h2’c2’(CR,C2’  -p  RJ-^Ch 

=  hfCf(C,R™,;Cf  +  R„,0^'C,h,'j  , 

^  {l2) 

where  we  use  the  fact  that  Rj;  =  CT0hh2’  +  R^,  along 


max 

{c,} 


1  cj2  \  l  -  a2r|,(C,) 

M 

s.t.  Etr(QR,,,Cf)<P. 

i=l 


(17) 


Clearly,  (17)  can  be  decoupled  into  two  sequential 
subtasks,  i.e.,  a  power  allocation  (among  clusters) 
problem  and  a  set  of  collaboration  matrix  design 
problems  that  can  be  solved  using  the  previous  results. 
To  see  this,  suppose  {Pj  ,  P2,  .  .  .  ,  Pjyi}  is  an  optimum 
power  assignment  with 

tr(C,R,,,Cf)<P;  V/6{1, 

M 

Y^P*<p 

i  =  l 
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then  (17)  is  simplified  into  a  set  of  identical  problems  as 


tions.  Briefly  speaking,  for  a  threshold  we  have 


1 

max 

{C,} 

s.t. 


_ 1 

tr(C,R,,,Cf)<P;. 


(18) 


Note  that  a0r|;(C,)  must  lie  within  the  interval 
(0,  1)  because  we  have  ri(C,)  >  0  and  p,(Cy)  >  0  from 
their  definitions.  Hence  (18)  is  equivalent  to 


rrmx  qi(C,) 

s.t.  tr(C,R,,,Cf)<P;,  (19) 

which  is  exactly  the  optimization  problem  discussed  in 
the  previous  section.  The  optimal  solution  to  (19)  is 
given  in  Theorem  1.  The  key  problem,  therefore, 
is  to  determine  the  optimum  power  assignment 
{PJ,  . . . ,  To  meet  this  goal,  we  need  to 

find  out  the  relationship  between  the  maximum 
objective  function  value  ri,(C*)  and  P*.  Recalling 
Theorem  1,  more  precisely,  (8),  we  have 


B,(CD  = 


P*  +  min  (d^,;) 


h 


aiP* 


(20) 


where  we  define  ot,-  =  hj  R^  /h,-,  P,-  =  min  (d„^y),  and 
is  a  column  vector  consisting  of  the  eigenvalues  of 
R^ (note  that  R^y  can  be  nondiagonal).  Substituting 
(20)  into  the  objective  function  of  (17),  we  get 


M 


E 


1 


M 


a.'P* 


f^(l-cj2a,)P*+^, 


(21) 


Clearly,  the  optimal  power  allocation  {Pj,  P|,  . . . ,  Pj^} 
must  be  the  one,  among  all  feasible  power  assignments, 
that  maximizes  (21).  Therefore,  it  can  be  found  out  by 


min  — 


a;P; 


M 

E(i_ci2a,)p.  +  p^. 


M 

s.t.  J2P‘^P 

i=l 


P,>0  Vi6{l, 


(22) 


It  is  easy  to  verify  that  the  optimization  problem  (22) 
is  convex  because  its  Hessian  matrix,  which  is  a  diagonal 
matrix  in  this  case,  is  positive  semidefmite  on  the  convex 
set  defined  by  the  linear  constraints.  Although  (22)  is 
efficiently  solvable  by  numerical  methods,  it  can  also  be 
solved  analytically  by  resorting  to  the  Lagrangian 
function  and  Karush-Kuhn-Tucker  conditions,  which 
leads  to  a  water-fiUing  type  power  allocation  scheme. 
The  details  are  omitted  here  because  of  space  limita- 


0 


otherwise 


(23) 


where  ([),■  =  a,7p,-,  (p;  =  (l  — a0Ot;)/P;.  It  is  easy  to  see 
that  each  cluster  can  decide  whether  to  transmit  or 
keep  silent  by  the  criterion  (j),-  &  X.  Note  that  (j),-  is 
the  ratio  of  hJ^R^lh,-  to  min  (d^,_,),  with  the  former 
is  a  measure  of  the  cluster’s  estimation  quality  (a 
larger  value  indicates  a  better  estimation  accuracy) 
and  the  latter  a  measure  of  the  cluster’s  channel 
quality  (a  smaller  value  indicates  a  better  channel 
quality). 

So  far  we  have  developed  an  analytical  approach  that 
leads  to  an  optimal  solution  to  (6).  For  clarity,  we  now 
summarize  the  steps  of  our  proposed  method. 


1.  Given  the  prior  knowledge  of  the  autocorrelation 

matrices  and  the  observa¬ 
tion  gain  vectors  compute  and 

{P,}fli,  where  a;  =  hJ’R^/h,'  and  P,-  =  min(d.;,_i). 

2.  Given  the  total  power  constraint  P,  find  the 
optimal  power  allocation  among  clusters  via  (22). 

3.  With  the  optimal  power  assignment  {Pj, 
P2,  . . . ,  P^}  derived  in  the  previous  step,  deter¬ 
mine  the  optimal  collaboration  matrices  {C,}^j 
via  (19),  whose  solution  is  detailed  in  Theorem  1. 


Simulation  results 

We  consider  the  single  cluster  case  and  carry  out  a 
simple  performance  analysis  to  corroborate  our  theo¬ 
retical  results  (more  analysis  and  simulation  results  are 
available  in  Fang  and  Li  [in  press]).  We  compare  our 
optimal  collaboration  strategy  with  the  scheme  pro¬ 
posed  in  Cui  et  al.  (2007),  where  there  is  no  intersensor 
collaboration  and  each  sensor  transmits  its  observation 
to  the  FC  with  optimally  assigned  power.  For 
simplicity,  we  consider  a  homogeneous  environment 
with  identical  observation  and  channel  qualities,  where 
denotes  the  observation  noise  variance  and 
represents  the  channel  noise  variance.  Also,  all 
observation  and  channel  gains  are  assumed  to  be 
unitary,  i.e.,  =1,  throughout  all  examples  in  the  article. 
Clearly,  an  equal  power  allocation  is  optimum  for  Cui 
et  al.  (2007)  and  the  corresponding  estimation  MSE 
can  be  shown  to  be 


MSEnc 


Pajal+Nalcyt+Nalalal 
PNal+Pdl^  +NgI(iI+N<jI<jI^  ’ 


(24) 


where  the  subscript  NC  denotes  noncoUaboration.  For 
our  collaboration  strategy,  the  estimation  MSE  can  be 
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Figure  2.  MSEs  of  collaborative  and  noncollaborative  strategies  versus  number  of  sensors  (tI  =  0.1,  nl  =  l,  and  P  =  1. 


computed  by  using  (8),  which  reduces  to 


MSEoc 


PNa]  +  Pal+Nalr^l  +  ’ 


(25) 


where  the  subscript  OC  denotes  optimal  coUahoration. 
For  notational  convenience,  let  a  =  Pct^ag  +7Va^a'g  and 
b  =  PN CTg  +  CTg .  It  can  he  easily  verified  that 


{a  +  N(jl(jl(7l){b  +  (jl(Tl) 

>  (a  +  (Tlaj,(Tl)ib  +  Ncrlcrl),  (26) 

where  (26)  becomes  an  equality  only  when  N  =  1. 
Hence  as  expected,  the  relationship  MSENC  s 
MSEOC  holds,  which  means  that  the  optimal  collab¬ 
oration  scheme  should  always  outperform  the  noncoUa- 
boration  scheme. 

Figure  2  depicts  the  estimation  MSEs  of  the  two 
schemes  as  a  function  of  N  under  a  total  transmit  power 
constraint,  with  a^  =  0.2  and  a^  =  l,  respectively. 
Erom  Figure  2,  we  see  that  both  schemes  benefit  from 
an  increasing  number  of  sensors;  as  N  increases,  the 
estimation  MSEs  wiU  asymptotically  approach  certain 
values  that,  however,  are  nonzero.  This  observation  can 
be  readily  verified  from  (24)-(25).  Also,  it  can  be  seen 
that  the  noncollaborative  scheme  is  sensitive  to  the  value 
of  as  the  observation  quality  deteriorates,  its 
performance  degrades  considerably.  In  contrast,  the 


collaborative  strategy  demonstrates  a  certain  degree  of 
robustness  against  the  observation  quality  deterioration. 
In  Figure  3,  we  plot  the  estimation  MSE  versus  the  total 
transmit  power.  We  see  that  the  performance  gap 
between  the  two  strategies  shrinks  as  the  transmit  power 
increases.  In  fact,  from  (24)-(25)  we  observe  that  as  the 
transmit  power  goes  to  infinity,  these  two  strategies 
approach  identical  performance.  This  suggests  that  the 
collaborative  strategy  should  be  preferred  especially 
when  the  sensor  observation  qualities  are  bad  and 
transmit  power  is  severely  constrained. 

Conclusion 

We  studied  an  optimal  collaboration  and  power 
allocation  problem  for  distributed  estimation  in  a 
power-constrained  collaborative  sensor  network,  where 
the  network  consists  of  a  number  of  sensor  clusters, 
and  collaboration  is  allowed  within  the  same  cluster 
but  not  across  clusters.  Our  theoretical  results  showed 
that,  given  a  specified  total  transmit  power,  the  power 
should  be  assigned  among  the  clusters  in  a  water-fiUing 
manner,  with  each  cluster  deciding  whether  to  transmit 
or  keep  silent  by  comparing  with  a  threshold  the  ratio 
of  a  measure  of  the  cluster’s  estimation  quality  to  a 
measure  of  the  cluster’s  channel  quality.  Also,  for  each 
cluster,  if  the  channels  from  this  cluster  to  the  EC  are 
independent,  then  an  optimal  collaboration  yields  only 
one  local  message,  which  is  sent  from  the  best  channel 
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Figure  3.  MSEs  of  collaborative  and  noncollaborative  strategies  versus  total  transmit  power.  al  =  0.1,  al  =  I,  and  N  =  50. 


within  the  cluster  to  the  FC;  otherwise  the  local 
message  has  to  be  sent  across  all  channels  within  the 
cluster  at  different  power  levels  matched  to  their 
channel  quality.  Specifically,  in  either  case,  the 
compressed  local  message  is  exactly  the  local  LMMSE 
estimate  multiplied  by  an  amplification  factor.  Simu¬ 
lation  results  have  been  presented  to  corroborate  our 
theoretical  analysis.  □ 
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